Overview
Brought to you by YData
Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 273074 |
| Missing cells | 567713 |
| Missing cells (%) | 7.2% |
| Total size in memory | 58.6 MiB |
| Average record size in memory | 225.0 B |
Variable types
| Text | 17 |
|---|---|
| Numeric | 11 |
| Boolean | 1 |
CITY has constant value "New York City" | Constant |
STATE has constant value "New York" | Constant |
COUNTRY has constant value "United States" | Constant |
IS_BAD_USER is highly imbalanced (99.9%) | Imbalance |
SUMMARY has 26668 (9.8%) missing values | Missing |
EXPERTISE has 95382 (34.9%) missing values | Missing |
CURRENTINDUSTRY has 3625 (1.3%) missing values | Missing |
PUB_NAME has 8047 (2.9%) missing values | Missing |
PUB_DATE has 57936 (21.2%) missing values | Missing |
PUB_DESCRIPTION has 149852 (54.9%) missing values | Missing |
AWARD_NAME has 6483 (2.4%) missing values | Missing |
AWARD_COMPANY has 31679 (11.6%) missing values | Missing |
AWARD_DESCRIPTION has 141832 (51.9%) missing values | Missing |
AWARD_DATE has 46076 (16.9%) missing values | Missing |
F_PROB has 26125 (9.6%) zeros | Zeros |
M_PROB has 27626 (10.1%) zeros | Zeros |
WHITE_PROB has 6465 (2.4%) zeros | Zeros |
BLACK_PROB has 40611 (14.9%) zeros | Zeros |
API_PROB has 26816 (9.8%) zeros | Zeros |
HISPANIC_PROB has 30085 (11.0%) zeros | Zeros |
NATIVE_PROB has 97216 (35.6%) zeros | Zeros |
MULTIPLE_PROB has 59816 (21.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-09-30 07:06:15.566334 |
|---|---|
| Analysis finished | 2025-09-30 07:06:22.251853 |
| Duration | 6.69 seconds |
| Software version | ydata-profiling vv4.17.0 |
| Download configuration | config.json |
Variables
TITLE
Text
| Distinct | 4596 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 133 |
| Missing (%) | < 0.1% |
| Memory size | 2.1 MiB |
Length
| Max length | 235 |
|---|---|
| Median length | 199 |
| Mean length | 82.19057599 |
| Min length | 1 |
Unique
| Unique | 2116 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Chief of Chime Enterprise | Entrepreneur, Businessman |
|---|---|
| 2nd row | Chief of Chime Enterprise | Entrepreneur, Businessman |
| 3rd row | Chief of Chime Enterprise | Entrepreneur, Businessman |
| 4th row | Chief of Chime Enterprise | Entrepreneur, Businessman |
| 5th row | Chief of Chime Enterprise | Entrepreneur, Businessman |
| Value | Count | Frequency (%) |
| 309737 | 9.7% | |
| at | 118357 | 3.7% |
| of | 108636 | 3.4% |
| and | 73114 | 2.3% |
| professor | 44426 | 1.4% |
| the | 33865 | 1.1% |
| director | 26167 | 0.8% |
| president | 25099 | 0.8% |
| university | 24357 | 0.8% |
| global | 21255 | 0.7% |
| Other values (6980) | 2395393 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2940729 | 13.1% | |
| e | 1910629 | 8.5% |
| i | 1433952 | 6.4% |
| r | 1414287 | 6.3% |
| a | 1366682 | 6.1% |
| t | 1363986 | 6.1% |
| n | 1316579 | 5.9% |
| o | 1212429 | 5.4% |
| s | 923066 | 4.1% |
| l | 650294 | 2.9% |
| Other values (151) | 7900545 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 22433178 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2940729 | 13.1% | |
| e | 1910629 | 8.5% |
| i | 1433952 | 6.4% |
| r | 1414287 | 6.3% |
| a | 1366682 | 6.1% |
| t | 1363986 | 6.1% |
| n | 1316579 | 5.9% |
| o | 1212429 | 5.4% |
| s | 923066 | 4.1% |
| l | 650294 | 2.9% |
| Other values (151) | 7900545 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 22433178 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2940729 | 13.1% | |
| e | 1910629 | 8.5% |
| i | 1433952 | 6.4% |
| r | 1414287 | 6.3% |
| a | 1366682 | 6.1% |
| t | 1363986 | 6.1% |
| n | 1316579 | 5.9% |
| o | 1212429 | 5.4% |
| s | 923066 | 4.1% |
| l | 650294 | 2.9% |
| Other values (151) | 7900545 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 22433178 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2940729 | 13.1% | |
| e | 1910629 | 8.5% |
| i | 1433952 | 6.4% |
| r | 1414287 | 6.3% |
| a | 1366682 | 6.1% |
| t | 1363986 | 6.1% |
| n | 1316579 | 5.9% |
| o | 1212429 | 5.4% |
| s | 923066 | 4.1% |
| l | 650294 | 2.9% |
| Other values (151) | 7900545 |
USER_LOCATION
Text
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 33 |
| Mean length | 32.53649194 |
| Min length | 12 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | New York, New York, United States |
|---|---|
| 2nd row | New York, New York, United States |
| 3rd row | New York, New York, United States |
| 4th row | New York, New York, United States |
| 5th row | New York, New York, United States |
| Value | Count | Frequency (%) |
| new | 448002 | |
| york | 448002 | |
| united | 192085 | |
| states | 192085 | |
| city | 98054 | 6.4% |
| metropolitan | 97980 | 6.4% |
| area | 60957 | 4.0% |
| ny | 92 | < 0.1% |
| greater | 70 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1264253 | ||
| e | 991249 | |
| t | 870339 | 9.8% |
| o | 643962 | 7.2% |
| r | 607079 | 6.8% |
| N | 448094 | 5.0% |
| Y | 448094 | 5.0% |
| k | 448002 | 5.0% |
| w | 448002 | 5.0% |
| i | 388119 | 4.4% |
| Other values (14) | 2327677 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8884870 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1264253 | ||
| e | 991249 | |
| t | 870339 | 9.8% |
| o | 643962 | 7.2% |
| r | 607079 | 6.8% |
| N | 448094 | 5.0% |
| Y | 448094 | 5.0% |
| k | 448002 | 5.0% |
| w | 448002 | 5.0% |
| i | 388119 | 4.4% |
| Other values (14) | 2327677 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8884870 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1264253 | ||
| e | 991249 | |
| t | 870339 | 9.8% |
| o | 643962 | 7.2% |
| r | 607079 | 6.8% |
| N | 448094 | 5.0% |
| Y | 448094 | 5.0% |
| k | 448002 | 5.0% |
| w | 448002 | 5.0% |
| i | 388119 | 4.4% |
| Other values (14) | 2327677 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8884870 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1264253 | ||
| e | 991249 | |
| t | 870339 | 9.8% |
| o | 643962 | 7.2% |
| r | 607079 | 6.8% |
| N | 448094 | 5.0% |
| Y | 448094 | 5.0% |
| k | 448002 | 5.0% |
| w | 448002 | 5.0% |
| i | 388119 | 4.4% |
| Other values (14) | 2327677 |
SUMMARY
Text
Missing
| Distinct | 3019 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 26668 |
| Missing (%) | 9.8% |
| Memory size | 2.1 MiB |
Length
| Max length | 3033 |
|---|---|
| Median length | 2510 |
| Mean length | 1336.236285 |
| Min length | 1 |
Unique
| Unique | 1088 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | As the Chief of Chime Enterprise, I lead a high-growth business unit delivering next-generation financial wellness solutions to many of the nationâs largest employers and their workforces. My work is grounded in a simple but urgent belief: the current financial system is fragmented and inequitableâand itâs not built for the way people earn and live today. At Chime, weâre building solutions that give workers better access to their money and a stronger path to long-term financial well-being. For employers, that means deeper engagement, better retention, and healthier teams overall. Before Chime, I spent nearly two decades advising and building solutions for some of the worldâs most powerful financial institutions, then went on to found and scale several fintech companies designed to close structural gaps in the system. Iâve seen how broken the infrastructure isâand Iâm committed to reshaping it so it works for everyone. More on my work and what drives it: https://jason-lee.co |
|---|---|
| 2nd row | As the Chief of Chime Enterprise, I lead a high-growth business unit delivering next-generation financial wellness solutions to many of the nationâs largest employers and their workforces. My work is grounded in a simple but urgent belief: the current financial system is fragmented and inequitableâand itâs not built for the way people earn and live today. At Chime, weâre building solutions that give workers better access to their money and a stronger path to long-term financial well-being. For employers, that means deeper engagement, better retention, and healthier teams overall. Before Chime, I spent nearly two decades advising and building solutions for some of the worldâs most powerful financial institutions, then went on to found and scale several fintech companies designed to close structural gaps in the system. Iâve seen how broken the infrastructure isâand Iâm committed to reshaping it so it works for everyone. More on my work and what drives it: https://jason-lee.co |
| 3rd row | As the Chief of Chime Enterprise, I lead a high-growth business unit delivering next-generation financial wellness solutions to many of the nationâs largest employers and their workforces. My work is grounded in a simple but urgent belief: the current financial system is fragmented and inequitableâand itâs not built for the way people earn and live today. At Chime, weâre building solutions that give workers better access to their money and a stronger path to long-term financial well-being. For employers, that means deeper engagement, better retention, and healthier teams overall. Before Chime, I spent nearly two decades advising and building solutions for some of the worldâs most powerful financial institutions, then went on to found and scale several fintech companies designed to close structural gaps in the system. Iâve seen how broken the infrastructure isâand Iâm committed to reshaping it so it works for everyone. More on my work and what drives it: https://jason-lee.co |
| 4th row | As the Chief of Chime Enterprise, I lead a high-growth business unit delivering next-generation financial wellness solutions to many of the nationâs largest employers and their workforces. My work is grounded in a simple but urgent belief: the current financial system is fragmented and inequitableâand itâs not built for the way people earn and live today. At Chime, weâre building solutions that give workers better access to their money and a stronger path to long-term financial well-being. For employers, that means deeper engagement, better retention, and healthier teams overall. Before Chime, I spent nearly two decades advising and building solutions for some of the worldâs most powerful financial institutions, then went on to found and scale several fintech companies designed to close structural gaps in the system. Iâve seen how broken the infrastructure isâand Iâm committed to reshaping it so it works for everyone. More on my work and what drives it: https://jason-lee.co |
| 5th row | As the Chief of Chime Enterprise, I lead a high-growth business unit delivering next-generation financial wellness solutions to many of the nationâs largest employers and their workforces. My work is grounded in a simple but urgent belief: the current financial system is fragmented and inequitableâand itâs not built for the way people earn and live today. At Chime, weâre building solutions that give workers better access to their money and a stronger path to long-term financial well-being. For employers, that means deeper engagement, better retention, and healthier teams overall. Before Chime, I spent nearly two decades advising and building solutions for some of the worldâs most powerful financial institutions, then went on to found and scale several fintech companies designed to close structural gaps in the system. Iâve seen how broken the infrastructure isâand Iâm committed to reshaping it so it works for everyone. More on my work and what drives it: https://jason-lee.co |
| Value | Count | Frequency (%) |
| and | 2624361 | 5.5% |
| the | 1618277 | 3.4% |
| of | 1343333 | 2.8% |
| in | 1147571 | 2.4% |
| a | 835440 | 1.8% |
| to | 745744 | 1.6% |
| i | 561212 | 1.2% |
| for | 517776 | 1.1% |
| as | 477944 | 1.0% |
| at | 468517 | 1.0% |
| Other values (28417) | 37197486 |
Most occurring characters
| Value | Count | Frequency (%) |
| 47369024 | ||
| e | 29360204 | 8.9% |
| a | 22069197 | 6.7% |
| i | 21296342 | 6.5% |
| n | 21070501 | 6.4% |
| t | 19469347 | 5.9% |
| o | 17817261 | 5.4% |
| r | 16987535 | 5.2% |
| s | 16032346 | 4.9% |
| l | 10749794 | 3.3% |
| Other values (173) | 107035087 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 329256638 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 47369024 | ||
| e | 29360204 | 8.9% |
| a | 22069197 | 6.7% |
| i | 21296342 | 6.5% |
| n | 21070501 | 6.4% |
| t | 19469347 | 5.9% |
| o | 17817261 | 5.4% |
| r | 16987535 | 5.2% |
| s | 16032346 | 4.9% |
| l | 10749794 | 3.3% |
| Other values (173) | 107035087 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 329256638 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 47369024 | ||
| e | 29360204 | 8.9% |
| a | 22069197 | 6.7% |
| i | 21296342 | 6.5% |
| n | 21070501 | 6.4% |
| t | 19469347 | 5.9% |
| o | 17817261 | 5.4% |
| r | 16987535 | 5.2% |
| s | 16032346 | 4.9% |
| l | 10749794 | 3.3% |
| Other values (173) | 107035087 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 329256638 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 47369024 | ||
| e | 29360204 | 8.9% |
| a | 22069197 | 6.7% |
| i | 21296342 | 6.5% |
| n | 21070501 | 6.4% |
| t | 19469347 | 5.9% |
| o | 17817261 | 5.4% |
| r | 16987535 | 5.2% |
| s | 16032346 | 4.9% |
| l | 10749794 | 3.3% |
| Other values (173) | 107035087 |
NUMCONNECTIONS
Real number (ℝ)
| Distinct | 345 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 465.8383478 |
| Minimum | 151 |
|---|---|
| Maximum | 500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 151 |
|---|---|
| 5-th percentile | 262 |
| Q1 | 500 |
| median | 500 |
| Q3 | 500 |
| 95-th percentile | 500 |
| Maximum | 500 |
| Range | 349 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 80.29544461 |
|---|---|
| Coefficient of variation (CV) | 0.1723676142 |
| Kurtosis | 5.060259262 |
| Mean | 465.8383478 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -2.456852149 |
| Sum | 127208341 |
| Variance | 6447.358425 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500 | 215124 | |
| 403 | 3235 | 1.2% |
| 321 | 3123 | 1.1% |
| 292 | 2330 | 0.9% |
| 168 | 1603 | 0.6% |
| 456 | 1563 | 0.6% |
| 494 | 1438 | 0.5% |
| 394 | 1238 | 0.5% |
| 188 | 1211 | 0.4% |
| 464 | 1197 | 0.4% |
| Other values (335) | 41012 | 15.0% |
| Value | Count | Frequency (%) |
| 151 | 22 | < 0.1% |
| 152 | 1186 | |
| 153 | 4 | < 0.1% |
| 154 | 139 | 0.1% |
| 155 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 500 | 215124 | |
| 499 | 7 | < 0.1% |
| 497 | 17 | < 0.1% |
| 495 | 2 | < 0.1% |
| 494 | 1438 | 0.5% |
F_PROB
Real number (ℝ)
Zeros
| Distinct | 1508 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4518314847 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 26125 |
| Zeros (%) | 9.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.004226263147 |
| median | 0.4040863216 |
| Q3 | 0.9969102144 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.9926839513 |
Descriptive statistics
| Standard deviation | 0.4488875153 |
|---|---|
| Coefficient of variation (CV) | 0.9934843642 |
| Kurtosis | -1.775321881 |
| Mean | 0.4518314847 |
| Median Absolute Deviation (MAD) | 0.4007193241 |
| Skewness | 0.1997508129 |
| Sum | 123383.4309 |
| Variance | 0.2015000014 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 27626 | 10.1% |
| 0 | 26125 | 9.6% |
| 0.5105789304 | 10759 | 3.9% |
| 0.003713601967 | 6329 | 2.3% |
| 0.9971190095 | 5810 | 2.1% |
| 0.5311644077 | 5109 | 1.9% |
| 0.004277224652 | 5006 | 1.8% |
| 0.005615275353 | 4473 | 1.6% |
| 0.004226263147 | 4294 | 1.6% |
| 0.4040863216 | 3750 | 1.4% |
| Other values (1498) | 173793 |
| Value | Count | Frequency (%) |
| 0 | 26125 | |
| 0.0002121970901 | 378 | 0.1% |
| 0.000220701826 | 2 | < 0.1% |
| 0.0002225783537 | 2 | < 0.1% |
| 0.0002333488956 | 221 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 27626 | |
| 0.9998624921 | 216 | 0.1% |
| 0.9998415709 | 1 | < 0.1% |
| 0.9997996688 | 1 | < 0.1% |
| 0.9997956753 | 1 | < 0.1% |
M_PROB
Real number (ℝ)
Zeros
| Distinct | 1508 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5481685133 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 27626 |
| Zeros (%) | 10.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.003089785576 |
| median | 0.5959136486 |
| Q3 | 0.9957737327 |
| 95-th percentile | 1 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.9926839471 |
Descriptive statistics
| Standard deviation | 0.4488875156 |
|---|---|
| Coefficient of variation (CV) | 0.8188859898 |
| Kurtosis | -1.775321887 |
| Mean | 0.5481685133 |
| Median Absolute Deviation (MAD) | 0.4007193446 |
| Skewness | -0.199750799 |
| Sum | 149690.5686 |
| Variance | 0.2015000017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 27626 | 10.1% |
| 1 | 26125 | 9.6% |
| 0.4894210398 | 10759 | 3.9% |
| 0.9962863922 | 6329 | 2.3% |
| 0.002880990505 | 5810 | 2.1% |
| 0.4688355625 | 5109 | 1.9% |
| 0.9957227707 | 5006 | 1.8% |
| 0.994384706 | 4473 | 1.6% |
| 0.9957737327 | 4294 | 1.6% |
| 0.5959136486 | 3750 | 1.4% |
| Other values (1498) | 173793 |
| Value | Count | Frequency (%) |
| 0 | 27626 | |
| 0.0001375079155 | 216 | 0.1% |
| 0.0001584291458 | 1 | < 0.1% |
| 0.0002003312111 | 1 | < 0.1% |
| 0.0002043247223 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 26125 | |
| 0.9997878075 | 378 | 0.1% |
| 0.999779284 | 2 | < 0.1% |
| 0.9997774363 | 2 | < 0.1% |
| 0.9997666478 | 221 | 0.1% |
WHITE_PROB
Real number (ℝ)
Zeros
| Distinct | 865 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5811596207 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 6465 |
| Zeros (%) | 2.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.002000000095 |
| Q1 | 0.1120000035 |
| median | 0.7409999967 |
| Q3 | 0.9629999995 |
| 95-th percentile | 0.9909999967 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.850999996 |
Descriptive statistics
| Standard deviation | 0.3945790435 |
|---|---|
| Coefficient of variation (CV) | 0.6789512372 |
| Kurtosis | -1.532550894 |
| Mean | 0.5811596207 |
| Median Absolute Deviation (MAD) | 0.2440000176 |
| Skewness | -0.4153614864 |
| Sum | 158699.5823 |
| Variance | 0.1556926216 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.003000000026 | 12626 | 4.6% |
| 0.002000000095 | 9662 | 3.5% |
| 0.7900000215 | 7958 | 2.9% |
| 0.9819999933 | 7775 | 2.8% |
| 0 | 6465 | 2.4% |
| 0.9629999995 | 5860 | 2.1% |
| 0.2049999982 | 5263 | 1.9% |
| 0.9869999886 | 5001 | 1.8% |
| 0.9840000272 | 4061 | 1.5% |
| 0.7409999967 | 3914 | 1.4% |
| Other values (855) | 204489 |
| Value | Count | Frequency (%) |
| 0 | 6465 | |
| 0.001000000047 | 2989 | 1.1% |
| 0.002000000095 | 9662 | |
| 0.003000000026 | 12626 | |
| 0.00400000019 | 2513 | 0.9% |
| Value | Count | Frequency (%) |
| 1 | 1644 | |
| 0.9990000129 | 1208 | |
| 0.9980000257 | 1914 | |
| 0.996999979 | 652 | 0.2% |
| 0.9959999919 | 1881 |
BLACK_PROB
Real number (ℝ)
Zeros
| Distinct | 601 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.09922306414 |
| Minimum | 0 |
|---|---|
| Maximum | 0.9980000257 |
| Zeros | 40611 |
| Zeros (%) | 14.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.001000000047 |
| median | 0.0120000001 |
| Q3 | 0.1089999974 |
| 95-th percentile | 0.5519999862 |
| Maximum | 0.9980000257 |
| Range | 0.9980000257 |
| Interquartile range (IQR) | 0.1079999974 |
Descriptive statistics
| Standard deviation | 0.1918493846 |
|---|---|
| Coefficient of variation (CV) | 1.933516025 |
| Kurtosis | 7.488700917 |
| Mean | 0.09922306414 |
| Median Absolute Deviation (MAD) | 0.0120000001 |
| Skewness | 2.753265525 |
| Sum | 27095.23902 |
| Variance | 0.03680618637 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.001000000047 | 42620 | 15.6% |
| 0 | 40611 | 14.9% |
| 0.008999999613 | 12993 | 4.8% |
| 0.003000000026 | 10698 | 3.9% |
| 0.1089999974 | 7939 | 2.9% |
| 0.002000000095 | 6361 | 2.3% |
| 0.01400000043 | 5547 | 2.0% |
| 0.25 | 5527 | 2.0% |
| 0.006000000052 | 5394 | 2.0% |
| 0.3300000131 | 4632 | 1.7% |
| Other values (591) | 130752 |
| Value | Count | Frequency (%) |
| 0 | 40611 | |
| 0.001000000047 | 42620 | |
| 0.002000000095 | 6361 | 2.3% |
| 0.003000000026 | 10698 | 3.9% |
| 0.00400000019 | 4031 | 1.5% |
| Value | Count | Frequency (%) |
| 0.9980000257 | 13 | < 0.1% |
| 0.9959999919 | 1 | < 0.1% |
| 0.9950000048 | 2 | < 0.1% |
| 0.9909999967 | 21 | < 0.1% |
| 0.9900000095 | 55 |
API_PROB
Real number (ℝ)
Zeros
| Distinct | 581 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1673660292 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 26816 |
| Zeros (%) | 9.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.001000000047 |
| median | 0.008999999613 |
| Q3 | 0.06599999964 |
| 95-th percentile | 0.9869999886 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.06499999959 |
Descriptive statistics
| Standard deviation | 0.3237927953 |
|---|---|
| Coefficient of variation (CV) | 1.934638689 |
| Kurtosis | 1.706700673 |
| Mean | 0.1673660292 |
| Median Absolute Deviation (MAD) | 0.008999999613 |
| Skewness | 1.839422567 |
| Sum | 45703.31105 |
| Variance | 0.1048417743 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.001000000047 | 52906 | |
| 0 | 26816 | 9.8% |
| 0.003000000026 | 19726 | 7.2% |
| 0.002000000095 | 16317 | 6.0% |
| 0.01899999939 | 9689 | 3.5% |
| 0.01499999966 | 9258 | 3.4% |
| 0.01999999955 | 6570 | 2.4% |
| 1 | 5888 | 2.2% |
| 0.9869999886 | 5503 | 2.0% |
| 0.004999999888 | 5347 | 2.0% |
| Other values (571) | 115054 |
| Value | Count | Frequency (%) |
| 0 | 26816 | |
| 0.001000000047 | 52906 | |
| 0.002000000095 | 16317 | 6.0% |
| 0.003000000026 | 19726 | 7.2% |
| 0.00400000019 | 5191 | 1.9% |
| Value | Count | Frequency (%) |
| 1 | 5888 | |
| 0.9990000129 | 206 | 0.1% |
| 0.9980000257 | 1443 | 0.5% |
| 0.996999979 | 387 | 0.1% |
| 0.9959999919 | 409 | 0.1% |
HISPANIC_PROB
Real number (ℝ)
Zeros
| Distinct | 591 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.12772503 |
| Minimum | 0 |
|---|---|
| Maximum | 1 |
| Zeros | 30085 |
| Zeros (%) | 11.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.002000000095 |
| median | 0.0120000001 |
| Q3 | 0.08299999684 |
| 95-th percentile | 0.9079999924 |
| Maximum | 1 |
| Range | 1 |
| Interquartile range (IQR) | 0.08099999675 |
Descriptive statistics
| Standard deviation | 0.2616267557 |
|---|---|
| Coefficient of variation (CV) | 2.048359321 |
| Kurtosis | 4.490410381 |
| Mean | 0.12772503 |
| Median Absolute Deviation (MAD) | 0.0120000001 |
| Skewness | 2.408271915 |
| Sum | 34878.38485 |
| Variance | 0.06844855931 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.001000000047 | 31087 | 11.4% |
| 0 | 30085 | 11.0% |
| 0.003000000026 | 15471 | 5.7% |
| 0.002000000095 | 14573 | 5.3% |
| 0.01099999994 | 12264 | 4.5% |
| 0.9819999933 | 10761 | 3.9% |
| 0.07699999958 | 7936 | 2.9% |
| 0.00400000019 | 7317 | 2.7% |
| 0.004999999888 | 5574 | 2.0% |
| 0.01999999955 | 4902 | 1.8% |
| Other values (581) | 133104 |
| Value | Count | Frequency (%) |
| 0 | 30085 | |
| 0.001000000047 | 31087 | |
| 0.002000000095 | 14573 | |
| 0.003000000026 | 15471 | |
| 0.00400000019 | 7317 | 2.7% |
| Value | Count | Frequency (%) |
| 1 | 125 | |
| 0.9990000129 | 150 | |
| 0.9980000257 | 311 | |
| 0.996999979 | 16 | < 0.1% |
| 0.9959999919 | 196 |
NATIVE_PROB
Real number (ℝ)
Zeros
| Distinct | 68 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.002862703912 |
| Minimum | 0 |
|---|---|
| Maximum | 0.1860000044 |
| Zeros | 97216 |
| Zeros (%) | 35.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.001000000047 |
| Q3 | 0.003000000026 |
| 95-th percentile | 0.01099999994 |
| Maximum | 0.1860000044 |
| Range | 0.1860000044 |
| Interquartile range (IQR) | 0.003000000026 |
Descriptive statistics
| Standard deviation | 0.006095161886 |
|---|---|
| Coefficient of variation (CV) | 2.129162524 |
| Kurtosis | 127.418397 |
| Mean | 0.002862703912 |
| Median Absolute Deviation (MAD) | 0.001000000047 |
| Skewness | 8.884767455 |
| Sum | 781.730008 |
| Variance | 3.715099841 × 10-5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 97216 | |
| 0.001000000047 | 53288 | |
| 0.002000000095 | 36976 | 13.5% |
| 0.003000000026 | 21369 | 7.8% |
| 0.004999999888 | 19152 | 7.0% |
| 0.00400000019 | 11316 | 4.1% |
| 0.01099999994 | 10956 | 4.0% |
| 0.006000000052 | 6068 | 2.2% |
| 0.007000000216 | 3125 | 1.1% |
| 0.0120000001 | 2479 | 0.9% |
| Other values (58) | 11129 | 4.1% |
| Value | Count | Frequency (%) |
| 0 | 97216 | |
| 0.001000000047 | 53288 | |
| 0.002000000095 | 36976 | 13.5% |
| 0.003000000026 | 21369 | 7.8% |
| 0.00400000019 | 11316 | 4.1% |
| Value | Count | Frequency (%) |
| 0.1860000044 | 2 | < 0.1% |
| 0.1299999952 | 1 | < 0.1% |
| 0.126000002 | 96 | |
| 0.125 | 6 | < 0.1% |
| 0.1070000008 | 1 | < 0.1% |
MULTIPLE_PROB
Real number (ℝ)
Zeros
| Distinct | 224 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02166355277 |
| Minimum | 0 |
|---|---|
| Maximum | 0.7269999981 |
| Zeros | 59816 |
| Zeros (%) | 21.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.001000000047 |
| median | 0.009999999776 |
| Q3 | 0.03099999949 |
| 95-th percentile | 0.06700000167 |
| Maximum | 0.7269999981 |
| Range | 0.7269999981 |
| Interquartile range (IQR) | 0.02999999944 |
Descriptive statistics
| Standard deviation | 0.03802145014 |
|---|---|
| Coefficient of variation (CV) | 1.755088398 |
| Kurtosis | 48.01212236 |
| Mean | 0.02166355277 |
| Median Absolute Deviation (MAD) | 0.009999999776 |
| Skewness | 5.679029795 |
| Sum | 5915.75301 |
| Variance | 0.00144563067 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 59816 | |
| 0.001000000047 | 17014 | 6.2% |
| 0.00400000019 | 15442 | 5.7% |
| 0.002000000095 | 9708 | 3.6% |
| 0.01400000043 | 9698 | 3.6% |
| 0.00800000038 | 9162 | 3.4% |
| 0.006000000052 | 8236 | 3.0% |
| 0.01099999994 | 6456 | 2.4% |
| 0.04399999976 | 6138 | 2.2% |
| 0.03099999949 | 5650 | 2.1% |
| Other values (214) | 125754 |
| Value | Count | Frequency (%) |
| 0 | 59816 | |
| 0.001000000047 | 17014 | 6.2% |
| 0.002000000095 | 9708 | 3.6% |
| 0.003000000026 | 4024 | 1.5% |
| 0.00400000019 | 15442 | 5.7% |
| Value | Count | Frequency (%) |
| 0.7269999981 | 1 | < 0.1% |
| 0.6320000291 | 1 | < 0.1% |
| 0.6119999886 | 1 | < 0.1% |
| 0.5619999766 | 1 | < 0.1% |
| 0.5210000277 | 7 |
PRESTIGE
Real number (ℝ)
| Distinct | 4622 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4890997578 |
| Minimum | -0.4861144423 |
|---|---|
| Maximum | 1.221478224 |
| Zeros | 452 |
| Zeros (%) | 0.2% |
| Negative | 46553 |
| Negative (%) | 17.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | -0.4861144423 |
|---|---|
| 5-th percentile | -0.240483731 |
| Q1 | 0.2873032093 |
| median | 0.5957869291 |
| Q3 | 0.7953392267 |
| 95-th percentile | 0.9192367792 |
| Maximum | 1.221478224 |
| Range | 1.707592666 |
| Interquartile range (IQR) | 0.5080360174 |
Descriptive statistics
| Standard deviation | 0.3723869814 |
|---|---|
| Coefficient of variation (CV) | 0.7613722466 |
| Kurtosis | -0.4988840956 |
| Mean | 0.4890997578 |
| Median Absolute Deviation (MAD) | 0.2239435911 |
| Skewness | -0.8215096006 |
| Sum | 133560.4273 |
| Variance | 0.1386720639 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.6412850618 | 10759 | 3.9% |
| 0.5805336833 | 6300 | 2.3% |
| 0.9301841259 | 5109 | 1.9% |
| 0.5957869291 | 4914 | 1.8% |
| 0.6592636108 | 3864 | 1.4% |
| 0.8960794806 | 3763 | 1.4% |
| 0.2517215908 | 3750 | 1.4% |
| 0.6492943764 | 3340 | 1.2% |
| -0.3139176369 | 3232 | 1.2% |
| 0.845836699 | 3120 | 1.1% |
| Other values (4612) | 224923 |
| Value | Count | Frequency (%) |
| -0.4861144423 | 1 | |
| -0.41669783 | 1 | |
| -0.4152959585 | 1 | |
| -0.4147300124 | 1 | |
| -0.4056269825 | 1 |
| Value | Count | Frequency (%) |
| 1.221478224 | 1 | < 0.1% |
| 1.219937801 | 50 | |
| 1.164294004 | 1 | < 0.1% |
| 1.149400234 | 88 | |
| 1.131468177 | 1 | < 0.1% |
HIGHEST_DEGREE
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 6 |
| Mean length | 6.346371313 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | empty |
|---|---|
| 2nd row | empty |
| 3rd row | empty |
| 4th row | empty |
| 5th row | empty |
| Value | Count | Frequency (%) |
| doctor | 100998 | |
| master | 72071 | |
| bachelor | 69160 | |
| mba | 15317 | 5.5% |
| empty | 12547 | 4.5% |
| high | 2910 | 1.1% |
| school | 2910 | 1.1% |
| associate | 71 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 277047 | |
| r | 242229 | |
| t | 185687 | |
| c | 173139 | |
| e | 153849 | |
| a | 141302 | |
| D | 100998 | 5.8% |
| M | 87388 | 5.0% |
| B | 84477 | 4.9% |
| h | 74980 | 4.3% |
| Other values (11) | 211933 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1733029 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 277047 | |
| r | 242229 | |
| t | 185687 | |
| c | 173139 | |
| e | 153849 | |
| a | 141302 | |
| D | 100998 | 5.8% |
| M | 87388 | 5.0% |
| B | 84477 | 4.9% |
| h | 74980 | 4.3% |
| Other values (11) | 211933 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1733029 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 277047 | |
| r | 242229 | |
| t | 185687 | |
| c | 173139 | |
| e | 153849 | |
| a | 141302 | |
| D | 100998 | 5.8% |
| M | 87388 | 5.0% |
| B | 84477 | 4.9% |
| h | 74980 | 4.3% |
| Other values (11) | 211933 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1733029 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 277047 | |
| r | 242229 | |
| t | 185687 | |
| c | 173139 | |
| e | 153849 | |
| a | 141302 | |
| D | 100998 | 5.8% |
| M | 87388 | 5.0% |
| B | 84477 | 4.9% |
| h | 74980 | 4.3% |
| Other values (11) | 211933 |
EXPERTISE
Text
Missing
| Distinct | 2505 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 95382 |
| Missing (%) | 34.9% |
| Memory size | 2.1 MiB |
Length
| Max length | 1882 |
|---|---|
| Median length | 859 |
| Mean length | 462.7336008 |
| Min length | 4 |
Unique
| Unique | 1181 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | autocad release 12,autocad release 2000,paradox,microsoft powerpoint,autocad accurender,windows xp,windows,spanish,microsoft outlook,press releases,windows 7,windows vista,adobe photoshop,premise 4.0,microsoft word,facebook,microsoft publisher,microsoft sharepoint,lotus notes,news editpro,mac osx,telnet,customer relations,microsoft excel,remedy,event planning,jet,copy editing,pine,microsoft exchange,windows 2000,microsoft access,file maker pro,operating systems,autocad release 14,writing,wordperfect,autocad release 13,jda systems,adobe acrobat,clarisworks,autocad release 10,adobe reader 9,adobe illustrator,proofreading,unix,language skills,cintra cts,autocad release 11,autocad anderson windows |
|---|---|
| 2nd row | autocad release 12,autocad release 2000,paradox,microsoft powerpoint,autocad accurender,windows xp,windows,spanish,microsoft outlook,press releases,windows 7,windows vista,adobe photoshop,premise 4.0,microsoft word,facebook,microsoft publisher,microsoft sharepoint,lotus notes,news editpro,mac osx,telnet,customer relations,microsoft excel,remedy,event planning,jet,copy editing,pine,microsoft exchange,windows 2000,microsoft access,file maker pro,operating systems,autocad release 14,writing,wordperfect,autocad release 13,jda systems,adobe acrobat,clarisworks,autocad release 10,adobe reader 9,adobe illustrator,proofreading,unix,language skills,cintra cts,autocad release 11,autocad anderson windows |
| 3rd row | autocad release 12,autocad release 2000,paradox,microsoft powerpoint,autocad accurender,windows xp,windows,spanish,microsoft outlook,press releases,windows 7,windows vista,adobe photoshop,premise 4.0,microsoft word,facebook,microsoft publisher,microsoft sharepoint,lotus notes,news editpro,mac osx,telnet,customer relations,microsoft excel,remedy,event planning,jet,copy editing,pine,microsoft exchange,windows 2000,microsoft access,file maker pro,operating systems,autocad release 14,writing,wordperfect,autocad release 13,jda systems,adobe acrobat,clarisworks,autocad release 10,adobe reader 9,adobe illustrator,proofreading,unix,language skills,cintra cts,autocad release 11,autocad anderson windows |
| 4th row | autocad release 12,autocad release 2000,paradox,microsoft powerpoint,autocad accurender,windows xp,windows,spanish,microsoft outlook,press releases,windows 7,windows vista,adobe photoshop,premise 4.0,microsoft word,facebook,microsoft publisher,microsoft sharepoint,lotus notes,news editpro,mac osx,telnet,customer relations,microsoft excel,remedy,event planning,jet,copy editing,pine,microsoft exchange,windows 2000,microsoft access,file maker pro,operating systems,autocad release 14,writing,wordperfect,autocad release 13,jda systems,adobe acrobat,clarisworks,autocad release 10,adobe reader 9,adobe illustrator,proofreading,unix,language skills,cintra cts,autocad release 11,autocad anderson windows |
| 5th row | autocad release 12,autocad release 2000,paradox,microsoft powerpoint,autocad accurender,windows xp,windows,spanish,microsoft outlook,press releases,windows 7,windows vista,adobe photoshop,premise 4.0,microsoft word,facebook,microsoft publisher,microsoft sharepoint,lotus notes,news editpro,mac osx,telnet,customer relations,microsoft excel,remedy,event planning,jet,copy editing,pine,microsoft exchange,windows 2000,microsoft access,file maker pro,operating systems,autocad release 14,writing,wordperfect,autocad release 13,jda systems,adobe acrobat,clarisworks,autocad release 10,adobe reader 9,adobe illustrator,proofreading,unix,language skills,cintra cts,autocad release 11,autocad anderson windows |
| Value | Count | Frequency (%) |
| release | 38030 | 0.9% |
| media | 28695 | 0.7% |
| 24348 | 0.6% | |
| team | 19841 | 0.5% |
| management | 19167 | 0.5% |
| business | 18306 | 0.4% |
| project | 16115 | 0.4% |
| creative | 13245 | 0.3% |
| and | 11423 | 0.3% |
| management,team | 11302 | 0.3% |
| Other values (27412) | 3913711 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 8008557 | 9.7% |
| i | 6991180 | 8.5% |
| a | 6342387 | 7.7% |
| n | 6051576 | 7.4% |
| t | 5793013 | 7.0% |
| , | 5407046 | 6.6% |
| r | 4852987 | 5.9% |
| s | 4644111 | 5.6% |
| o | 4454135 | 5.4% |
| 3934655 | 4.8% | |
| Other values (106) | 25744412 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 82224059 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 8008557 | 9.7% |
| i | 6991180 | 8.5% |
| a | 6342387 | 7.7% |
| n | 6051576 | 7.4% |
| t | 5793013 | 7.0% |
| , | 5407046 | 6.6% |
| r | 4852987 | 5.9% |
| s | 4644111 | 5.6% |
| o | 4454135 | 5.4% |
| 3934655 | 4.8% | |
| Other values (106) | 25744412 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 82224059 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 8008557 | 9.7% |
| i | 6991180 | 8.5% |
| a | 6342387 | 7.7% |
| n | 6051576 | 7.4% |
| t | 5793013 | 7.0% |
| , | 5407046 | 6.6% |
| r | 4852987 | 5.9% |
| s | 4644111 | 5.6% |
| o | 4454135 | 5.4% |
| 3934655 | 4.8% | |
| Other values (106) | 25744412 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 82224059 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 8008557 | 9.7% |
| i | 6991180 | 8.5% |
| a | 6342387 | 7.7% |
| n | 6051576 | 7.4% |
| t | 5793013 | 7.0% |
| , | 5407046 | 6.6% |
| r | 4852987 | 5.9% |
| s | 4644111 | 5.6% |
| o | 4454135 | 5.4% |
| 3934655 | 4.8% | |
| Other values (106) | 25744412 |
CURRENTINDUSTRY
Text
Missing
| Distinct | 215 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 3625 |
| Missing (%) | 1.3% |
| Memory size | 2.1 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 43 |
| Mean length | 18.17111958 |
| Min length | 5 |
Unique
| Unique | 19 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Computer Software |
|---|---|
| 2nd row | Computer Software |
| 3rd row | Computer Software |
| 4th row | Computer Software |
| 5th row | Computer Software |
| Value | Count | Frequency (%) |
| 52808 | 8.6% | |
| services | 35451 | 5.7% |
| health | 27360 | 4.4% |
| care | 26672 | 4.3% |
| hospital | 24293 | 3.9% |
| education | 23232 | 3.8% |
| practice | 21469 | 3.5% |
| higher | 21148 | 3.4% |
| law | 18899 | 3.1% |
| management | 17264 | 2.8% |
| Other values (234) | 348621 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 470769 | 9.6% |
| i | 442073 | 9.0% |
| a | 391151 | 8.0% |
| t | 379859 | 7.8% |
| n | 373521 | 7.6% |
| 347768 | 7.1% | |
| r | 298310 | 6.1% |
| o | 233012 | 4.8% |
| c | 221547 | 4.5% |
| s | 184157 | 3.8% |
| Other values (42) | 1554023 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4896190 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 470769 | 9.6% |
| i | 442073 | 9.0% |
| a | 391151 | 8.0% |
| t | 379859 | 7.8% |
| n | 373521 | 7.6% |
| 347768 | 7.1% | |
| r | 298310 | 6.1% |
| o | 233012 | 4.8% |
| c | 221547 | 4.5% |
| s | 184157 | 3.8% |
| Other values (42) | 1554023 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4896190 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 470769 | 9.6% |
| i | 442073 | 9.0% |
| a | 391151 | 8.0% |
| t | 379859 | 7.8% |
| n | 373521 | 7.6% |
| 347768 | 7.1% | |
| r | 298310 | 6.1% |
| o | 233012 | 4.8% |
| c | 221547 | 4.5% |
| s | 184157 | 3.8% |
| Other values (42) | 1554023 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4896190 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 470769 | 9.6% |
| i | 442073 | 9.0% |
| a | 391151 | 8.0% |
| t | 379859 | 7.8% |
| n | 373521 | 7.6% |
| 347768 | 7.1% | |
| r | 298310 | 6.1% |
| o | 233012 | 4.8% |
| c | 221547 | 4.5% |
| s | 184157 | 3.8% |
| Other values (42) | 1554023 |
IS_BAD_USER
Boolean
Imbalance
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 266.8 KiB |
| False | |
|---|---|
| True | 15 |
| Value | Count | Frequency (%) |
| False | 273059 | |
| True | 15 | < 0.1% |
UPDATED_DT
Text
| Distinct | 475 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Unique
| Unique | 198 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 2025-08-30 00:00:00.000 |
|---|---|
| 2nd row | 2025-08-30 00:00:00.000 |
| 3rd row | 2025-08-30 00:00:00.000 |
| 4th row | 2025-08-30 00:00:00.000 |
| 5th row | 2025-08-30 00:00:00.000 |
| Value | Count | Frequency (%) |
| 00:00:00.000 | 273074 | |
| 2025-08-29 | 40548 | 7.4% |
| 2025-08-30 | 30749 | 5.6% |
| 2025-08-28 | 13847 | 2.5% |
| 2025-08-31 | 12903 | 2.4% |
| 2025-09-01 | 11118 | 2.0% |
| 2025-08-26 | 10908 | 2.0% |
| 2025-07-07 | 6419 | 1.2% |
| 2025-05-05 | 6306 | 1.2% |
| 2025-06-23 | 6063 | 1.1% |
| Other values (466) | 134213 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3101724 | |
| 2 | 684309 | 10.9% |
| - | 546148 | 8.7% |
| : | 546148 | 8.7% |
| 5 | 299845 | 4.8% |
| 273074 | 4.3% | |
| . | 273074 | 4.3% |
| 8 | 189658 | 3.0% |
| 1 | 88675 | 1.4% |
| 3 | 81814 | 1.3% |
| Other values (4) | 196233 | 3.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6280702 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3101724 | |
| 2 | 684309 | 10.9% |
| - | 546148 | 8.7% |
| : | 546148 | 8.7% |
| 5 | 299845 | 4.8% |
| 273074 | 4.3% | |
| . | 273074 | 4.3% |
| 8 | 189658 | 3.0% |
| 1 | 88675 | 1.4% |
| 3 | 81814 | 1.3% |
| Other values (4) | 196233 | 3.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6280702 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3101724 | |
| 2 | 684309 | 10.9% |
| - | 546148 | 8.7% |
| : | 546148 | 8.7% |
| 5 | 299845 | 4.8% |
| 273074 | 4.3% | |
| . | 273074 | 4.3% |
| 8 | 189658 | 3.0% |
| 1 | 88675 | 1.4% |
| 3 | 81814 | 1.3% |
| Other values (4) | 196233 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6280702 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 3101724 | |
| 2 | 684309 | 10.9% |
| - | 546148 | 8.7% |
| : | 546148 | 8.7% |
| 5 | 299845 | 4.8% |
| 273074 | 4.3% | |
| . | 273074 | 4.3% |
| 8 | 189658 | 3.0% |
| 1 | 88675 | 1.4% |
| 3 | 81814 | 1.3% |
| Other values (4) | 196233 | 3.1% |
PUB_NAME
Text
Missing
| Distinct | 24238 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 8047 |
| Missing (%) | 2.9% |
| Memory size | 2.1 MiB |
Length
| Max length | 263 |
|---|---|
| Median length | 209 |
| Mean length | 68.05756772 |
| Min length | 1 |
Unique
| Unique | 4476 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | New Payday Options for Making Ends Meet |
|---|---|
| 2nd row | DailyPay CYCLE Feature |
| 3rd row | Help employees take control of their finances with DailyPay by giving them the financial flexibility they desire |
| 4th row | PAYTECH January 2021 : Page 12 |
| 5th row | SVUS Awards® Winners Announced in Annual 2020 Business Awards |
| Value | Count | Frequency (%) |
| the | 87588 | 3.4% |
| of | 81810 | 3.2% |
| and | 69777 | 2.7% |
| in | 64784 | 2.5% |
| a | 43779 | 1.7% |
| for | 42481 | 1.7% |
| to | 42100 | 1.6% |
| with | 21081 | 0.8% |
| 20173 | 0.8% | |
| on | 18400 | 0.7% |
| Other values (35753) | 2080462 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2312201 | 12.8% | |
| e | 1550043 | 8.6% |
| i | 1184131 | 6.6% |
| n | 1142688 | 6.3% |
| a | 1103517 | 6.1% |
| o | 1087105 | 6.0% |
| t | 1067083 | 5.9% |
| r | 928676 | 5.1% |
| s | 842308 | 4.7% |
| l | 578686 | 3.2% |
| Other values (175) | 6240655 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 18037093 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2312201 | 12.8% | |
| e | 1550043 | 8.6% |
| i | 1184131 | 6.6% |
| n | 1142688 | 6.3% |
| a | 1103517 | 6.1% |
| o | 1087105 | 6.0% |
| t | 1067083 | 5.9% |
| r | 928676 | 5.1% |
| s | 842308 | 4.7% |
| l | 578686 | 3.2% |
| Other values (175) | 6240655 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 18037093 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2312201 | 12.8% | |
| e | 1550043 | 8.6% |
| i | 1184131 | 6.6% |
| n | 1142688 | 6.3% |
| a | 1103517 | 6.1% |
| o | 1087105 | 6.0% |
| t | 1067083 | 5.9% |
| r | 928676 | 5.1% |
| s | 842308 | 4.7% |
| l | 578686 | 3.2% |
| Other values (175) | 6240655 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 18037093 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2312201 | 12.8% | |
| e | 1550043 | 8.6% |
| i | 1184131 | 6.6% |
| n | 1142688 | 6.3% |
| a | 1103517 | 6.1% |
| o | 1087105 | 6.0% |
| t | 1067083 | 5.9% |
| r | 928676 | 5.1% |
| s | 842308 | 4.7% |
| l | 578686 | 3.2% |
| Other values (175) | 6240655 |
PUB_DATE
Text
Missing
| Distinct | 4944 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 57936 |
| Missing (%) | 21.2% |
| Memory size | 2.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 363 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 2016-07-04 |
|---|---|
| 2nd row | 2021-03-02 |
| 3rd row | 2021-02-19 |
| 4th row | 2021-01-01 |
| 5th row | 2020-10-29 |
| Value | Count | Frequency (%) |
| 2018-01-01 | 3562 | 1.7% |
| 2020-01-01 | 3518 | 1.6% |
| 2019-01-01 | 3109 | 1.4% |
| 2013-01-01 | 2799 | 1.3% |
| 2021-01-01 | 2668 | 1.2% |
| 2009-01-01 | 2552 | 1.2% |
| 2010-01-01 | 2392 | 1.1% |
| 2017-01-01 | 2301 | 1.1% |
| 2012-01-01 | 2172 | 1.0% |
| 2015-01-01 | 1979 | 0.9% |
| Other values (4934) | 188086 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 590128 | |
| - | 430276 | |
| 1 | 407152 | |
| 2 | 370448 | |
| 9 | 60513 | 2.8% |
| 3 | 58910 | 2.7% |
| 8 | 51674 | 2.4% |
| 6 | 47612 | 2.2% |
| 4 | 45907 | 2.1% |
| 5 | 44842 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2151380 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 590128 | |
| - | 430276 | |
| 1 | 407152 | |
| 2 | 370448 | |
| 9 | 60513 | 2.8% |
| 3 | 58910 | 2.7% |
| 8 | 51674 | 2.4% |
| 6 | 47612 | 2.2% |
| 4 | 45907 | 2.1% |
| 5 | 44842 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2151380 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 590128 | |
| - | 430276 | |
| 1 | 407152 | |
| 2 | 370448 | |
| 9 | 60513 | 2.8% |
| 3 | 58910 | 2.7% |
| 8 | 51674 | 2.4% |
| 6 | 47612 | 2.2% |
| 4 | 45907 | 2.1% |
| 5 | 44842 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2151380 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 590128 | |
| - | 430276 | |
| 1 | 407152 | |
| 2 | 370448 | |
| 9 | 60513 | 2.8% |
| 3 | 58910 | 2.7% |
| 8 | 51674 | 2.4% |
| 6 | 47612 | 2.2% |
| 4 | 45907 | 2.1% |
| 5 | 44842 | 2.1% |
PUB_DESCRIPTION
Text
Missing
| Distinct | 10707 |
|---|---|
| Distinct (%) | 8.7% |
| Missing | 149852 |
| Missing (%) | 54.9% |
| Memory size | 2.1 MiB |
Length
| Max length | 2048 |
|---|---|
| Median length | 1725 |
| Mean length | 356.6687686 |
| Min length | 3 |
Unique
| Unique | 2392 ? |
|---|---|
| Unique (%) | 1.9% |
Sample
| 1st row | This insider guide to Stony Brook University in Stony Brook, NY, features more than 160 pages of in-depth information, including student reviews, rankings across 20 campus life topics, and insider tips from students on campus. Written by a student at Stony Brook, this guidebook gives you the inside scoop on everything from academics and nightlife to housing and the meal plan. Read both the good and the bad and discover if Stony Brook is right for you. One of nearly 500 College Prowler guides, this Stony Brook guide features updated facts and figures along with the latest student reviews and insider tips from current students on campus. Find out what itâs like to be a student at Stony Brook and see if Stony Brook is the place for you. |
|---|---|
| 2nd row | Pascale discusses a âmake-or-breakâ moment in her career, and how her former career as an athlete has shaped her as a leader. |
| 3rd row | An Interview with Pascale Witz, Executive Vice President, Diabetes & Cardiovascular, Sanofi |
| 4th row | As one of Fortuneâs Most Powerful Women, Pascale regularly contributes to their website. In this post, Pascale gives her advice to first-time managers. |
| 5th row | Abstract: The neocortex contains excitatory neurons and inhibitory interneurons. Clones of neocortical excitatory neurons originating from the same progenitor cell are spatially organized and contribute to the formation of functional microcircuits. In contrast, relatively little is known about the production and organization of neocortical inhibitory interneurons. We found that neocortical inhibitory interneurons were produced as spatially organized clonal units in the developing ventral telencephalon. Furthermore, clonally related interneurons did not randomly disperse but formed spatially isolated clusters in the neocortex. Individual clonal clusters consisting of interneurons expressing the same or distinct neurochemical markers exhibited clear vertical or horizontal organization. These results suggest that the lineage relationship plays a pivotal role in the organization of inhibitory interneurons in the neocortex. |
| Value | Count | Frequency (%) |
| the | 331878 | 5.0% |
| and | 218522 | 3.3% |
| of | 206640 | 3.1% |
| to | 157856 | 2.4% |
| a | 137311 | 2.1% |
| in | 136133 | 2.1% |
| for | 74639 | 1.1% |
| is | 59943 | 0.9% |
| that | 49837 | 0.8% |
| on | 47795 | 0.7% |
| Other values (49715) | 5156906 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6511731 | ||
| e | 4129383 | 9.4% |
| t | 2938649 | 6.7% |
| a | 2753425 | 6.3% |
| i | 2718988 | 6.2% |
| o | 2630762 | 6.0% |
| n | 2626018 | 6.0% |
| r | 2309110 | 5.3% |
| s | 2224447 | 5.1% |
| l | 1375968 | 3.1% |
| Other values (173) | 13730958 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 43949439 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 6511731 | ||
| e | 4129383 | 9.4% |
| t | 2938649 | 6.7% |
| a | 2753425 | 6.3% |
| i | 2718988 | 6.2% |
| o | 2630762 | 6.0% |
| n | 2626018 | 6.0% |
| r | 2309110 | 5.3% |
| s | 2224447 | 5.1% |
| l | 1375968 | 3.1% |
| Other values (173) | 13730958 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 43949439 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 6511731 | ||
| e | 4129383 | 9.4% |
| t | 2938649 | 6.7% |
| a | 2753425 | 6.3% |
| i | 2718988 | 6.2% |
| o | 2630762 | 6.0% |
| n | 2626018 | 6.0% |
| r | 2309110 | 5.3% |
| s | 2224447 | 5.1% |
| l | 1375968 | 3.1% |
| Other values (173) | 13730958 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 43949439 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 6511731 | ||
| e | 4129383 | 9.4% |
| t | 2938649 | 6.7% |
| a | 2753425 | 6.3% |
| i | 2718988 | 6.2% |
| o | 2630762 | 6.0% |
| n | 2626018 | 6.0% |
| r | 2309110 | 5.3% |
| s | 2224447 | 5.1% |
| l | 1375968 | 3.1% |
| Other values (173) | 13730958 |
AWARD_NAME
Text
Missing
| Distinct | 15802 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 6483 |
| Missing (%) | 2.4% |
| Memory size | 2.1 MiB |
Length
| Max length | 257 |
|---|---|
| Median length | 172 |
| Mean length | 45.23419395 |
| Min length | 3 |
Unique
| Unique | 4636 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | 2019 Top 20 Digital Innovator in Benefits |
|---|---|
| 2nd row | 2019 Top 20 Digital Innovator in Benefits |
| 3rd row | 2019 Top 20 Digital Innovator in Benefits |
| 4th row | 2019 Top 20 Digital Innovator in Benefits |
| 5th row | 2019 Top 20 Digital Innovator in Benefits |
| Value | Count | Frequency (%) |
| award | 70858 | 4.1% |
| 51507 | 3.0% | |
| of | 46198 | 2.7% |
| the | 41018 | 2.4% |
| in | 33840 | 1.9% |
| for | 32090 | 1.8% |
| and | 22793 | 1.3% |
| best | 16004 | 0.9% |
| new | 14449 | 0.8% |
| top | 13211 | 0.8% |
| Other values (13425) | 1399878 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1479856 | 12.3% | |
| e | 1009011 | 8.4% |
| a | 746531 | 6.2% |
| n | 740301 | 6.1% |
| i | 736120 | 6.1% |
| r | 710027 | 5.9% |
| o | 673561 | 5.6% |
| t | 630779 | 5.2% |
| s | 448923 | 3.7% |
| l | 376048 | 3.1% |
| Other values (160) | 4507872 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12059029 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1479856 | 12.3% | |
| e | 1009011 | 8.4% |
| a | 746531 | 6.2% |
| n | 740301 | 6.1% |
| i | 736120 | 6.1% |
| r | 710027 | 5.9% |
| o | 673561 | 5.6% |
| t | 630779 | 5.2% |
| s | 448923 | 3.7% |
| l | 376048 | 3.1% |
| Other values (160) | 4507872 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12059029 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1479856 | 12.3% | |
| e | 1009011 | 8.4% |
| a | 746531 | 6.2% |
| n | 740301 | 6.1% |
| i | 736120 | 6.1% |
| r | 710027 | 5.9% |
| o | 673561 | 5.6% |
| t | 630779 | 5.2% |
| s | 448923 | 3.7% |
| l | 376048 | 3.1% |
| Other values (160) | 4507872 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12059029 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1479856 | 12.3% | |
| e | 1009011 | 8.4% |
| a | 746531 | 6.2% |
| n | 740301 | 6.1% |
| i | 736120 | 6.1% |
| r | 710027 | 5.9% |
| o | 673561 | 5.6% |
| t | 630779 | 5.2% |
| s | 448923 | 3.7% |
| l | 376048 | 3.1% |
| Other values (160) | 4507872 |
AWARD_COMPANY
Text
Missing
| Distinct | 8391 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 31679 |
| Missing (%) | 11.6% |
| Memory size | 2.1 MiB |
Length
| Max length | 246 |
|---|---|
| Median length | 130 |
| Mean length | 23.98495826 |
| Min length | 1 |
Unique
| Unique | 1975 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | Employee Benefit News |
|---|---|
| 2nd row | Employee Benefit News |
| 3rd row | Employee Benefit News |
| 4th row | Employee Benefit News |
| 5th row | Employee Benefit News |
| Value | Count | Frequency (%) |
| 60960 | 7.2% | |
| of | 47305 | 5.6% |
| university | 23904 | 2.8% |
| the | 17039 | 2.0% |
| and | 13149 | 1.6% |
| awards | 12840 | 1.5% |
| society | 10710 | 1.3% |
| association | 10655 | 1.3% |
| new | 10448 | 1.2% |
| school | 10139 | 1.2% |
| Other values (7729) | 626050 |
Most occurring characters
| Value | Count | Frequency (%) |
| 602564 | 10.4% | |
| e | 480409 | 8.3% |
| i | 415986 | 7.2% |
| o | 372568 | 6.4% |
| a | 370102 | 6.4% |
| n | 362740 | 6.3% |
| r | 320239 | 5.5% |
| t | 312795 | 5.4% |
| s | 267131 | 4.6% |
| l | 195995 | 3.4% |
| Other values (133) | 2089320 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5789849 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 602564 | 10.4% | |
| e | 480409 | 8.3% |
| i | 415986 | 7.2% |
| o | 372568 | 6.4% |
| a | 370102 | 6.4% |
| n | 362740 | 6.3% |
| r | 320239 | 5.5% |
| t | 312795 | 5.4% |
| s | 267131 | 4.6% |
| l | 195995 | 3.4% |
| Other values (133) | 2089320 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5789849 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 602564 | 10.4% | |
| e | 480409 | 8.3% |
| i | 415986 | 7.2% |
| o | 372568 | 6.4% |
| a | 370102 | 6.4% |
| n | 362740 | 6.3% |
| r | 320239 | 5.5% |
| t | 312795 | 5.4% |
| s | 267131 | 4.6% |
| l | 195995 | 3.4% |
| Other values (133) | 2089320 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5789849 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 602564 | 10.4% | |
| e | 480409 | 8.3% |
| i | 415986 | 7.2% |
| o | 372568 | 6.4% |
| a | 370102 | 6.4% |
| n | 362740 | 6.3% |
| r | 320239 | 5.5% |
| t | 312795 | 5.4% |
| s | 267131 | 4.6% |
| l | 195995 | 3.4% |
| Other values (133) | 2089320 |
AWARD_DESCRIPTION
Text
Missing
| Distinct | 8602 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 141832 |
| Missing (%) | 51.9% |
| Memory size | 2.1 MiB |
Length
| Max length | 2070 |
|---|---|
| Median length | 984 |
| Mean length | 200.8836881 |
| Min length | 2 |
Unique
| Unique | 2657 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | Winner of Best Derivatives Provider, North America in 2011, 2013 |
|---|---|
| 2nd row | Winner of Best Derivatives Provider, North America in 2011, 2013 |
| 3rd row | Winner of Best Derivatives Provider, North America in 2011, 2013 |
| 4th row | Winner of Best Derivatives Provider, North America in 2011, 2013 |
| 5th row | Winner of Best Derivatives Provider, North America in 2011, 2013 |
| Value | Count | Frequency (%) |
| the | 217502 | 5.7% |
| and | 127689 | 3.3% |
| of | 119735 | 3.1% |
| in | 91789 | 2.4% |
| to | 82822 | 2.2% |
| for | 76368 | 2.0% |
| a | 66998 | 1.8% |
| is | 28370 | 0.7% |
| as | 27047 | 0.7% |
| award | 26932 | 0.7% |
| Other values (21559) | 2947559 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3693661 | ||
| e | 2387026 | 9.1% |
| t | 1699242 | 6.4% |
| a | 1641916 | 6.2% |
| i | 1623962 | 6.2% |
| n | 1577838 | 6.0% |
| o | 1533452 | 5.8% |
| r | 1407228 | 5.3% |
| s | 1295108 | 4.9% |
| d | 792504 | 3.0% |
| Other values (171) | 8712440 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 26364377 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3693661 | ||
| e | 2387026 | 9.1% |
| t | 1699242 | 6.4% |
| a | 1641916 | 6.2% |
| i | 1623962 | 6.2% |
| n | 1577838 | 6.0% |
| o | 1533452 | 5.8% |
| r | 1407228 | 5.3% |
| s | 1295108 | 4.9% |
| d | 792504 | 3.0% |
| Other values (171) | 8712440 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 26364377 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3693661 | ||
| e | 2387026 | 9.1% |
| t | 1699242 | 6.4% |
| a | 1641916 | 6.2% |
| i | 1623962 | 6.2% |
| n | 1577838 | 6.0% |
| o | 1533452 | 5.8% |
| r | 1407228 | 5.3% |
| s | 1295108 | 4.9% |
| d | 792504 | 3.0% |
| Other values (171) | 8712440 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 26364377 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3693661 | ||
| e | 2387026 | 9.1% |
| t | 1699242 | 6.4% |
| a | 1641916 | 6.2% |
| i | 1623962 | 6.2% |
| n | 1577838 | 6.0% |
| o | 1533452 | 5.8% |
| r | 1407228 | 5.3% |
| s | 1295108 | 4.9% |
| d | 792504 | 3.0% |
| Other values (171) | 8712440 |
AWARD_DATE
Text
Missing
| Distinct | 407 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 46076 |
| Missing (%) | 16.9% |
| Memory size | 2.1 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2019-05-01 |
|---|---|
| 2nd row | 2019-05-01 |
| 3rd row | 2019-05-01 |
| 4th row | 2019-05-01 |
| 5th row | 2019-05-01 |
| Value | Count | Frequency (%) |
| 2015-01-01 | 5714 | 2.5% |
| 2016-01-01 | 4959 | 2.2% |
| 2013-01-01 | 4814 | 2.1% |
| 2020-01-01 | 4677 | 2.1% |
| 2019-01-01 | 4080 | 1.8% |
| 2014-01-01 | 3986 | 1.8% |
| 2018-01-01 | 3775 | 1.7% |
| 2012-01-01 | 3371 | 1.5% |
| 2021-05-01 | 3370 | 1.5% |
| 2017-01-01 | 3339 | 1.5% |
| Other values (397) | 184913 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 698511 | |
| 1 | 498545 | |
| - | 453996 | |
| 2 | 328699 | |
| 9 | 57377 | 2.5% |
| 5 | 52655 | 2.3% |
| 6 | 39154 | 1.7% |
| 4 | 38211 | 1.7% |
| 8 | 38200 | 1.7% |
| 3 | 35504 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2269980 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 698511 | |
| 1 | 498545 | |
| - | 453996 | |
| 2 | 328699 | |
| 9 | 57377 | 2.5% |
| 5 | 52655 | 2.3% |
| 6 | 39154 | 1.7% |
| 4 | 38211 | 1.7% |
| 8 | 38200 | 1.7% |
| 3 | 35504 | 1.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2269980 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 698511 | |
| 1 | 498545 | |
| - | 453996 | |
| 2 | 328699 | |
| 9 | 57377 | 2.5% |
| 5 | 52655 | 2.3% |
| 6 | 39154 | 1.7% |
| 4 | 38211 | 1.7% |
| 8 | 38200 | 1.7% |
| 3 | 35504 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2269980 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 698511 | |
| 1 | 498545 | |
| - | 453996 | |
| 2 | 328699 | |
| 9 | 57377 | 2.5% |
| 5 | 52655 | 2.3% |
| 6 | 39154 | 1.7% |
| 4 | 38211 | 1.7% |
| 8 | 38200 | 1.7% |
| 3 | 35504 | 1.6% |
CITY
Text
Constant
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | New York City |
|---|---|
| 2nd row | New York City |
| 3rd row | New York City |
| 4th row | New York City |
| 5th row | New York City |
| Value | Count | Frequency (%) |
| new | 273074 | |
| york | 273074 | |
| city | 273074 |
Most occurring characters
| Value | Count | Frequency (%) |
| 546148 | ||
| N | 273074 | |
| e | 273074 | |
| w | 273074 | |
| Y | 273074 | |
| o | 273074 | |
| r | 273074 | |
| k | 273074 | |
| C | 273074 | |
| i | 273074 | |
| Other values (2) | 546148 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3549962 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 546148 | ||
| N | 273074 | |
| e | 273074 | |
| w | 273074 | |
| Y | 273074 | |
| o | 273074 | |
| r | 273074 | |
| k | 273074 | |
| C | 273074 | |
| i | 273074 | |
| Other values (2) | 546148 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3549962 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 546148 | ||
| N | 273074 | |
| e | 273074 | |
| w | 273074 | |
| Y | 273074 | |
| o | 273074 | |
| r | 273074 | |
| k | 273074 | |
| C | 273074 | |
| i | 273074 | |
| Other values (2) | 546148 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3549962 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 546148 | ||
| N | 273074 | |
| e | 273074 | |
| w | 273074 | |
| Y | 273074 | |
| o | 273074 | |
| r | 273074 | |
| k | 273074 | |
| C | 273074 | |
| i | 273074 | |
| Other values (2) | 546148 |
STATE
Text
Constant
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | New York |
|---|---|
| 2nd row | New York |
| 3rd row | New York |
| 4th row | New York |
| 5th row | New York |
| Value | Count | Frequency (%) |
| new | 273074 | |
| york | 273074 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 273074 | |
| e | 273074 | |
| w | 273074 | |
| 273074 | ||
| Y | 273074 | |
| o | 273074 | |
| r | 273074 | |
| k | 273074 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2184592 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 273074 | |
| e | 273074 | |
| w | 273074 | |
| 273074 | ||
| Y | 273074 | |
| o | 273074 | |
| r | 273074 | |
| k | 273074 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2184592 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 273074 | |
| e | 273074 | |
| w | 273074 | |
| 273074 | ||
| Y | 273074 | |
| o | 273074 | |
| r | 273074 | |
| k | 273074 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2184592 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 273074 | |
| e | 273074 | |
| w | 273074 | |
| 273074 | ||
| Y | 273074 | |
| o | 273074 | |
| r | 273074 | |
| k | 273074 |
COUNTRY
Text
Constant
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.1 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | United States |
| 3rd row | United States |
| 4th row | United States |
| 5th row | United States |
| Value | Count | Frequency (%) |
| united | 273074 | |
| states | 273074 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 819222 | |
| e | 546148 | |
| U | 273074 | 7.7% |
| n | 273074 | 7.7% |
| i | 273074 | 7.7% |
| d | 273074 | 7.7% |
| 273074 | 7.7% | |
| S | 273074 | 7.7% |
| a | 273074 | 7.7% |
| s | 273074 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3549962 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 819222 | |
| e | 546148 | |
| U | 273074 | 7.7% |
| n | 273074 | 7.7% |
| i | 273074 | 7.7% |
| d | 273074 | 7.7% |
| 273074 | 7.7% | |
| S | 273074 | 7.7% |
| a | 273074 | 7.7% |
| s | 273074 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3549962 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 819222 | |
| e | 546148 | |
| U | 273074 | 7.7% |
| n | 273074 | 7.7% |
| i | 273074 | 7.7% |
| d | 273074 | 7.7% |
| 273074 | 7.7% | |
| S | 273074 | 7.7% |
| a | 273074 | 7.7% |
| s | 273074 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3549962 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 819222 | |
| e | 546148 | |
| U | 273074 | 7.7% |
| n | 273074 | 7.7% |
| i | 273074 | 7.7% |
| d | 273074 | 7.7% |
| 273074 | 7.7% | |
| S | 273074 | 7.7% |
| a | 273074 | 7.7% |
| s | 273074 | 7.7% |
USER_ID
Real number (ℝ)
| Distinct | 4757 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 483499942 |
| Minimum | 1241390 |
|---|---|
| Maximum | 2225886059 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.1 MiB |
Quantile statistics
| Minimum | 1241390 |
|---|---|
| 5-th percentile | 51293060 |
| Q1 | 228217596 |
| median | 424195151 |
| Q3 | 613549428 |
| 95-th percentile | 1056527074 |
| Maximum | 2225886059 |
| Range | 2224644669 |
| Interquartile range (IQR) | 385331832 |
Descriptive statistics
| Standard deviation | 408882318.3 |
|---|---|
| Coefficient of variation (CV) | 0.8456719075 |
| Kurtosis | 7.17125062 |
| Mean | 483499942 |
| Median Absolute Deviation (MAD) | 189354277 |
| Skewness | 2.393891773 |
| Sum | 1.320312632 × 1014 |
| Variance | 1.671847502 × 1017 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 613549428 | 10759 | 3.9% |
| 295024428 | 6300 | 2.3% |
| 275633734 | 5109 | 1.9% |
| 522221259 | 4914 | 1.8% |
| 125578367 | 3864 | 1.4% |
| 536929814 | 3763 | 1.4% |
| 177348486 | 3750 | 1.4% |
| 680814974 | 3340 | 1.2% |
| 707836292 | 3232 | 1.2% |
| 363846886 | 3120 | 1.1% |
| Other values (4747) | 224923 |
| Value | Count | Frequency (%) |
| 1241390 | 12 | < 0.1% |
| 1249886 | 60 | |
| 1284328 | 40 | |
| 1490244 | 30 | |
| 1511359 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 2225886059 | 4 | |
| 2225747644 | 1 | < 0.1% |
| 2225599636 | 1 | < 0.1% |
| 2225251020 | 2 | |
| 2224246512 | 1 | < 0.1% |